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Abstract 

Correlations between random variables play an important role in applications, 
e.g. in financial analysis. More precisely, accurate estimates of the correlation be- 
tween financial returns are crucial in portfolio management. In particular, in pe- 
riods of financial crisis, extreme movements in asset prices are found to be more 
highly correlated than small movements. It is precisely under these conditions that 
investors are extremely concerned about changes on correlations. We propose a bi- 
nary segmentation procedure to detect the number and position of multiple change 
points in the correlation structure of random variables. It is shown analytically that 
the proposed algorithm asymptotically gives the correct number of change points 
and the change points are consistently estimated. It is also shown by simulation 
studies and by an empirical application that the algorithm yields reasonable results. 
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1. Introduction and Summary 



There are many empirical hints that the correlation structure of financial returns of all 



sorts cannot be assumed to be constant over time, see e.g. Longin and Solnik (1995) 



and Krishan et al. (2009). Especially in times of crisis, correlation often increases, a 



phenomenon which is referred to as "Diversification Meltdown" (Campbell et al. 2008). 



Recently, Wied, Kramer and Dehling (2011) proposed a CUSUM type procedure adapting 



ideas by Ploberger et al. (1989) to formally test if correlations between random variables 



remain constant over time. Given a bivariate series of random variables, denoted by 
(X t ,Y t ), the fluctuation test uses the test statistic 



Q T {X, Y) = D max 



J 



2<j<T y/f 



\Pj - PT\ , 



(i; 



where pj is the empirical correlation up to time j, for j = 2, . . . , T and D is a normalizing 
constant which is described in Appendix A.l. The null hypothesis of constant correlation 
is rejected when the test statistic becomes too large, i.e. when the estimated correlations 
fluctuate too much over time. However, with this approach the practitioner is just able 
to see if there is a change or not; he cannot determine where a possible change occurs or 
how many changes we have. 

The present paper fills this gap by proposing an algorithm based on the correlation con- 
stancy test to estimate the number of change points and the time of the chang es as a 
fraction in the interval [0,1]. For this purpose, we adapt a method for estimation of 



multiple breaks which was originally proposed by Vostrikova (1981) and dealt with or 



implemented in various problems by Inclan and Tiao (1994), Bai (1997), Bai and Per 



ron 



(1998), Andreou and Ghysels (2002), Galeano (2007) and Galeano and Tsay (2010), 



among others. The segmentation algorithm proceeds as follows: First, we want to find the 
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(what we will later call) dominating change point and decide if this point is statistically 
significant. Second, we split the series in two parts and look for possible change points 
again in each part of the series. The procedure stops if we do not find any new change 
point any more. In this paper, we will analytically show that the algorithm asymptot- 
ically gives the correct number of change points and the change points are consistently 
estimated, assuming that there exists a finite number of change points. Furthermore, we 
will show that the algorithm gives reasonable results in simulations with finite samples 
and in an empirical application. 

The rest of the paper is organized as follows. Section 2 introduces the proposed procedure. 
In Section 3, we derive the asymptotic properties of the procedure. In Sections 4 and 5, 
we present some simulation studies and a real data application, respectively. Section 6 
provides some conclusions. All proofs are presented in the Appendix. 

2. Algorithm for the detection of change points 

In this section, we present the algorithm for detection of change points in random vari- 
ables. It is made for a bivariate series of random variables and can be applied to general 
sequences of random variables with some serial dependency. 

To be more precisely, let (X t ,Y t ),t £ Z, be a sequence of bivariate random variables 
with finite first four moments and let 1, ... , T be the observation period. Denoting the 
correlation between X t and Y t with 

= Cov(X u Y t ) 
Pt y/Var(X t )y/Var(Y t y 

the test problem is 

H Q : p 1 = . . . = p T vs. Hi : 3t £ {1, . . . , T - 1} : p t ^ p t +i- 



Wied et al. 



(2011) show that the asymptotic null distribution of the change point test 



for this test problem under certain assumptions is the supremum over the absolute value 
of a Brownian Bridge. In this paper, we basically focus on the test's behavior under the 
alternative. 

Two crucial assumptions for our procedures are a boundedness assumption and a serial 



dependence assumption. We basically use the same assumption as (A2) and (A3) in Wied 



et al. (2011), this means 



Assumption 1. The r-th absolute moments of the components ofUt = (Af, Yf, X t , Y t , X t Y t ) 
are uniformly bounded for some r > 2. 

Assumption 2. The vector (X t ,Y t ) is L 2 -NED (near-epoch dependent) with size — (r — 
l)/(r — 2), where r is from Assumption^ and constants (ct),t G TL, on a sequence 
(V t ),t G Z, which is a-mixing of size (ft* := — r/(r — 2), i.e., 

\\{X U Y t ) - E((X t , Y t )\a{V t „ m , . . . , V t+m ))\\ 2 < c t v m 

with linim^oo v m = 0, such that c t < 2\\U t \\2 with U t from Assumption^ and the L 2 -norm 

We assume that there is a finite number of change points. However, as usual in practice, 



the number, location and size of the change points are unknown. Wied et al. (2011) allow 
for some fluctuations in the first and second moments under the null hypothesis, compare 
their assumption (A4). However, for ease of exposition we focus on the case in which 
expectations and variances are constant under the alternative. It would be possible to 
extend the framework to changing expectations and variances, but this would only change 
the proofs, not the procedure itself. 
The formal assumption is: 

Assumption 3. Under the alternative, expectations and variances are constant and equal 
to fi x , ii y , and a 2 y , the second cross moment changes from E(Ajl^) = m xy to E(XjYJ) = 
m xy + g ) ■ The function g(z), z G [0, 1] is a step function with a finite number of steps 



b, i.e. there is a partition = Sq < si < • • • < s?, < Sfe+i = 1 such that 

b 

9{z) = ^2ail{ze[s t ,s l+1 )} 



i=0 



and g{l) = a^. 

The function g is the function that gives information about the time and height of the 
correlation jumps. Using this expression, which is similar to the expression in the local 



power analysis of Wied et al. ( 2011 ) , we can describe the jumps in an elegant way. For ease 
of exposition, we assume m xy to be 0. Then, for instance, assuming that m x = m y = 
and a 2 x = <j 2 y = 1, if, say, the correlation is equal to 0.5 in the first half of the sample, 
jumps to 0.7 in the middle of the sample and falls down to 0.6 after the third quarter of 
the sample, the function g would be 



g(z) = 0.5 • l{ ze [o,o.5)} + 0.7 ■ l{ z£ [o.5,o.75)} + 0.6 ■ l{ z6 [o.75,i]}- (2) 

The condition on the finiteness of the steps of g ensures that there is a finite number 
of change points b under the alternative. Our goal is to estimate b, the values Si, . . . , s& 
and the values ao, . . . , a&. For that, we propose a binary segmentation type algorithm for 
estimating the number, location and size of multiple correlation changes. Thereby, we 
concentrate on the "change point fractions" in the interval [0, 1]. If e.g. a break occurs 
in the middle of the sample, the value of interest would be 0.5 and not T/2. 
The main idea behind the procedure is to isolate each change point in different time 
intervals by splitting the two series into two parts once a change point is found. Then, 
the search of a new change point is initialized in both pieces. The proposed procedure for 
detecting correlation changes essentially bases on the intuitive estimator of the change 
point fraction. To obtain it, note that we can write 

* T ( z ) 

Q T {X,Y) = sup D—f= \p T{z) - pr\ 
ze[o,i] vT 
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with t(z) = [2 + z(T — 2)]. Then, in the first step of the procedure, the estimator is the 
value of z which maximizes the function 

" T ( z ) 

B T {z) := D-j= \p T{z) -pr\ . 

Note that B T (z) is calculated based on all observations. In the next steps of the algorithm, 
we just consider the observations in the relevant part of the sample. 
The algorithm proceeds as follows: 

Change detection procedure 

1. Let X t and Y t be the observed series. Obtain the test statistic Qt(X,Y). There 
are two possibilities: 

(a) If the test statistic is statistically significant, i.e., if Qt(X,Y) > c T , a , where 
CT, a is the asymptotic critical value for a given upper tail probability, then a 
correlation change is detected. Let z\ be the point at which the function Bt(z) 
attains its maximum value and go to step 2. 

(b) Otherwise, there are no correlation changes and the procedure ends. 

2. Let zi, . . . , zg be the £ change points already detected in previous iterations sorted 
in increasing order. Repeat this step until, 

max {Qt(X, Y), k = 1, . . . ,£ + l} < c T , a , 

k 

where Q^(X, Y) is the value of the statistic Qt(X, Y) obtained for values of t(z) 
with z in the interval z k _i + 1/T < z < Zk, for k — 1, . . . , £ + 1, taking z = and 
ze+i = 1- 

3. Let (zi, . . . , zg) be the vector of detected change points sorted in increasing order. 
If £ > 1, refine the estimate of the location of the change points by calculating the 
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statistic Qt(X,Y) for values of t(z) with z in the subintervals z^-i + 1/T < z < 
Zk+i, for k = 1, . . . , £, where zq = and Z£ + i — 1. If any of the change points is not 
statistically significant, delete it from the list, and repeat this step. 

4. Finally, the correlation between X t and Y t is estimated in the intervals between 
change points. 

Some comments on the proposed procedure are in order. First, the key point of the 
proposed procedure is that it detects a single change point in each iteration, which may 
not be the most efficient way to detect correlation changes when multiple changes exist. 
However, our theoretical results show that the procedure consistently detects the true 
change points. Moreover, the proposed procedure works well in small samples in terms 
of detection of the true number of changes as shown in the Monte Carlo experiments of 
Section 4. Second, step 3 is included to refine the estimation of the change points. Note 
that in this step, the procedure computes the value of the Qt(X, Y) statistics in intervals 
that are only affected by the presence of one change point, something not achieved in 
step 2. Third, the main objective of the proposed procedure is to identify points at 
which further attention is needed. Thus, if the number of change points detected is 
large compared with the sample size, then a piecewise constant correlation may not be 
a good way to describe the correlation between the two series. Fourth, although our 
theoretical results are shown assuming that the critical value used in the procedure tends 
to infinity with the sample size, in practice, we use different critical values in each step of 
the procedure. Using the same critical level in steps 2 and 3 may lead to over-estimation 
of the number of change points, because more tests are performed in each iteration as 
the number of detected change points increases and the type-I errors accumulate. Then, 
to avoid this multiple-test problem we assume that the type-I errors used depend on the 
number of change points already detected by the algorithm. In particular, in step 1 we 
use an initial type-I error such as «o = 0.05^] Then, after detecting the (£ — l)-th change 
1 See also the discussion on the error levels before Theorem [2] 
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point, we use the critical value cy )Qf , where ctg is given by 1 — a = (1 — ag) e+1 . This 
choice of otg is taken to maintain the same significance level for all tests. Finally, we use 
the quantiles of the distribution of the supremum of the absolute value of a Brownian 
Bridge (the limit distribution of the correlaton test statistic under the null hypothesis) in 
order to apply the procedure in practice. The explicit form of this distribution function 
is 

x < 



see also Billingsley (1968), p. 85. For example, for «o = 0.05 the critical value is 1.358. 



3. Asymptotic results 

In this section, we show that the algorithm proposed in Section [2] works. To this end, we 
maintain another assumption which guarantees that we do not have two or more change 
points with "equal form", i.e. we assume that there are always change points which 
dominate others. 

Assumption 4. Let < l\ < l 2 < 1 be arbitrary. The function g from Assumption^ is 
such that the function 



A*(z) :-- 



g(t)dt-z / g(t)dt 



z e [h,l 2 ] 



is either constant or has a unique maximum. 



A dominating change point is then defined as the argmax of A* (z) in a given interval 
[Zi,^]- We illustrate Assumption [4] in the case of example rt2J). On the interval [0, 1] for 



S 



example, the function A*(z) then looks like 



A*(z) = ( 



0.075 ■ z when z < 0.5, 

0.1-0.125-^ when 0.5 < z < 0.75, 
0.025 - 0.025 • z when 0.75 < z < 1 



and has a unique maximum at z — 0.5, i.e. the point with the "strongest" correlation 
change, see also Figure [T] 

Figure [T] around here 

In general, the height and the position of the change point decides if it is dominant or 
not. Assumption [4] is violated if the correlation jumps at equal sizes at symmetric time 
points, e.g. in the case 



g{z) = 0.5 ■ l{^ e [0,0.25)} + 0.7 • l{ze[o.25,0.75)} + 0.5 ■ l{ 2e [0.75,l]}' 



(3) 



A*(z) 



because here, 

0.1 • z when z < 0.25, 

0.05-0.1-z when 0.25 < z < 0.5, 
O.l-z-0.05 when 0.5 < z < 0.75, 
0.1-0.1-2 when0.75<2<l 
has two non-unique maxima in z = 0.25 and z = 0.75, see also Figure [2j 



Figure [2] around here 



Finally, we need a rather technical assumption regarding the normalizing constant D. 
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Assumption 5. Consider an arbitrary interval [Zi,^] Q [0,1] with l\ < l 2 and let D be 
the estimator from Appendix A.l calculated from data from r(/i) to r^)- Then, under 
the null and the alternative, D converges to a real number D\ G (0, oo). 

For the interval [0,1], this assumption is for example fulfilled in a simple model with 
a bivariate normal distribution, serial independence, constant expectations 0, constant 
variances 1 and a change in the covariance in the middle of the sample from p\ to p 2 - In 
this case, it is easy to see that D\ is equal to {DcD^DaD^D'^ 1 / 2 with 
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I-Ph) 



'1/2 



and this is a real positive number e.g. for pi = —p 2 - 

The next theorem shows that the change point estimator is consistent if it is known a 
priori that there is a change point in a given interval. 

Theorem 1. Let Assumptions^ [1| [3| [7] an<i[3] be true and let there be one or several break 
points in a given interval [lijh] Q [0, 1] with l\ < l 2 - Then the change point estimator is 
consistent for the dominating change point. 

While also of interest in its own, the preceding theorem is mainly needed for the next 
theorem yielding the convergence of the algorithm. We require one additional assumption 
on the critical value Or,a- While we argued in the preceding section that we have to adjust 
the value for finite T due to multiple testing problems, we need another kind of assumption 



for the asymptotics as T — > oo. This assumption is basically the same as Bai (1997) also 
uses (Proposition 11). 
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Assumption 6. The critical value CT, a used in the algorithm fulfills the condition 
liniT-s-oo CT, a = oo and cr, a = o(y/T). 

The assumption rules e.g. out a choice of initial type-I error such as ao = 0.05 because 
the initial type-I error must converge to 0. However, it is legitimate using a fixed type-I 
error in finite samples if we consider an upper bound for T. The initial type-I error 
a = 0.05 then corresponds to ct )0 = 1.358, see the discussion before Section 3. 

Theorem 2. Under Assumptions^ [£[ [3| [5| an<i[6| the change point algorithm asymp- 
totically gives the correct number of change points b and the change points are consistently 
estimated. 

4. Monte Carlo experiments 

In this section, we carry out several Monte Carlo experiments to gain insight into the 
finite sample performance of the proposed procedure. In particular, we study several 
aspects, including the size (type-I error) of the procedure, the power of the procedure in 
correct detection of the changes and the ability of the procedure to accurately identify 
the location of the change points. 

First, we check the size (type-I error) of the procedure. For that, we consider a vector 
autoregressive of order 1 given by: 



fx, 


\ 


4* 




fx 




+ 


(A 


u 


/ 




4.) 




J 




\4 



where (ej,ef) are iid bivariate t& distributed with correlation p. The Student-t distribu- 
tion is considered to make our data long tailed. Three values of the correlation parameter 
p are considered, p = —.5, and .5. Two values of the parameter <ft are considered, = 0, 
that represent the case in which X t and Y t are iid observations, and <ft — -5, which repre- 
sents the case of temporal dependency. The sample sizes considered in the experiments 
are T = 200, 500, 1000, 2000 and 3000, that are usual in finance. Table [T] gives the 
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results based on 1000 replications and an initial nominal significant level of = 0.05. 
From this table, it seems that the type-I error of the proposed procedure is very close 
to the initial nominal level even with the smallest sample size. Therefore, overdetection 
does not appear to be an issue for the proposed procedure if there are no changes in the 
correlation. 



Table [T] around here 



Next, we analyze the power of the proposed detection procedure when there is a single 
change point in the series. The Monte Carlo setup is similar to the one described previ- 
ously, but the series are generated with a single change point in the correlation. Three 
locations of the change point are considered, z\ = 0.25, 0.50 and 0.75. Thus, for each 
sample size T, the time points of the change are t\ = [0.25T], [0.50T], and [0.75T], re- 
spectively. The change is such that the correlation of the series before the change point 
is po = .5 and then changes to p\ = 0, .25 or .75. The value of pi — represents a big 
change in correlation while the values p\ — .25 and .75 represent moderate changes. The 
results are shown in Table [2] Some comments on the table apply. First, the procedure 
performs quite well in detecting a single change point, with many cases over 90% correct 
detection. Second, as the sample size increases and the size of the change gets larger, the 
procedure works better. However, the magnitudes of the exception are small in general. 
Third, when the sample size of the change is small, the probability of under-detection 
may be large. Fourth, the power of the procedure is larger if the correlation coefficient 
increases than if the correlation coefficient decreases, see the second and third part of 
the table for the cases p\ = .25 or .75. This is particularly so when the sample size 
is small. Fifth, the location of the change point does not strongly affect the detection 
frequency of the procedure when the sample size is large. However, if the sample size 
is small then the procedure detects more frequently the change point at the middle of 
the series. Finally, in most cases, the percentage of false detection is smaller than the 
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nominal 5%. In particular, the frequency of over-detection is small for all cases. On the 
other hand, Table [3] shows the median and mean absolute deviation of the change point 
estimators in each case. The median of the estimates are quite close to the true change 
point locations. Note that the larger is the size of the change, the better is estimated its 
location. 

Table [2] around here 
Table [3] around here 

Next, we conduct another Monte Carlo experiment to study the power of the proposed 
procedure for detecting two change points. In this case, the location of the change 
points considered are Z\ = 0.33 and z 2 = 0.66. Thus, the time point of the changes are 
t\ = [0.33T] and t 2 = [0.66T], respectively, for each sample size T. Three situations are 
considered. First, the changes are such that the correlation of the series before the first 
change point is po = -5, then changes to p\ — at the second change point, and, finally, 
changes to p 2 = .25. Second, the correlation of the series before the first change point 
is po — -5, then changes to pi = .25 at the second change point, and, finally, changes 
to P2 = .75. Third, the correlation of the series before the first change point is po = .5, 
then changes to pi = .75 at the second change point, and, finally, changes to p 2 = .25. 
The results are shown in Table |4} As in the case of a single change point, the proposed 
procedure works reasonably well, especially when the sample size is large or the size of the 
correlation change is large. In addition, the procedure does not overdetect the number 
of change points. It may underestimate the number of change points, however. The 
underestimation can be serious when the sample size is small, say T = 200. Finally, the 
percentage of false change points detected in both cases, one and two change points, is 
smaller than the nominal 5% in almost all the cases. On the other hand, Table [5] shows 
the median and mean absolute deviation of the estimates of the change point locations. 
Note that the medians of the estimates are quite close to the true ones. Also, it appears 
that the larger is the size of the change, the better is estimated its location. 
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Table [4] around here 
Table [5] around here 



5. Application 

In this section, we look for changes in the correlation structure of the log-return series 
of two U.S. assets: the Standard & Poors 500 Index and the IBM stock from January 2, 
1997 to December 31, 2010 consisting of T = 3524 data points. Both log-returns series 
are plotted in Figure [3j which shows different volatility periods. The empirical correlation 
of both log-returns is given by 0.6225. The autocorrelation functions of the log-returns 
show some minor serial dependence, while the autocorrelation functions of the squared 
log-returns shows serial dependency, as usual in stock market returns. 

Figure [3] around here 

Next, we apply the proposed segmentation procedure of Section 2 to detect correlation 
changes for the log-returns of the S&P 500 and IBM stock assets. Tableland Figures|4| [5j 
[6]and [7]show the iterations taken by the procedure. Similar to the simulation experiments 
of Section |4j we start with the asymptotic critical value at the 5% significance level. In 
the first iteration, the procedure detects a change in the correlation at time point t = 988 
(November 29, 2000). Indeed, as shown in Figure [4] there are two local modes of the 
CUSUM statistic. The value of the test statistic (jl]) is 1.5699, which is significant at 
the 5% level. Following the proposed procedure, we split the series into two subperiods 
and look for changes in the subintervals [1,988] and [989,3524], respectively. In the first 
subinterval (see Figure [5]), the procedure detects a change at time point t = 664 (August 
19, 1999). The value of the test statistic is 2.1009. Then, we split the subinterval [1, 988] 
into two subintervals and look for changes in the subintervals [1,664], [665,988] and 
[989, 3524] (see Figure [6]). No more changes were found in these three subintervals. Then, 
we pass to step 3 and refine the search. For that we estimate the location of the change 
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points in the intervals [1,988] and [665,3524], respectively. In the first subinterval (see 
Figure [7]), as in the previous step, the procedure detects a change at time point t = 664 
(August 19, 1999) and the value of the test statistic is 2.1009. On the other hand, in the 
second subinterval (see Figure [7]), as in the previous step, the procedure detects a change 
at time point t = 2734 (November 12, 2007) and the value of the test statistic is 1.6193. 
These are the finally estimated change points. 

Table [6] around here 
Figure [4] around here 
Figure [5] around here 
Figure [6] around here 
Figure [7] around here 

The empirical correlation coefficients in the three subintervals are 0.6284, 0.5785 and 
0.7823, respectively, indicating that the correlation shifted to a smaller value after the 
first change point and to a higher value after the second change point. Figure [8] shows the 
scatterplots of the two log-returns indexes at three different subperiods. It is interesting 
to see that the dates of the detected change points fare well with well known financial 
facts. The period starting at 1994 till the end of 1999 is a period of economic growth 
in the U.S. economy in which the inflation was under control and the unemployment 
rate dropped to below 5%. This is a period with high increases in the stock markets. 
However, the collapse of the dot-com bubble started at the end of the 1990s and the 
beginning of the 2000s, and the market gave back around the 75% of the growth obtained 
in the 1990s. However, note that, contrarily to the diversification meltdown theory, the 
correlation did not increase during the dot-com bubble crisis. The third estimated change 
point roughly corresponds to the beginning of the Global Financial Crisis around the end 
of 2007, which is considered by many economists the worst financial crisis since the Great 
Depression of the 1930s. The reduction of interest rates leads to several consequent 
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issues starting with the easiness of obtaining credit, leading to sub-prime lending, so that 
a increased debt burden, and finally a liquidity shortfall in the banking system. This 
resulted in the collapse of well known financial institutions such as Lehman Brothers, 
Merrill Lynch, Washington Mutual, Wachovia, and AIG, among others, the bailout of 
banks by national governments such as Bear Stearns, Citigroup, Bank of America and 
Northern Rock, among others, and great loses in stock markets around the world. In this 
case, the Global Financial Crisis produced an increase in the correlation between both 
log-returns. Of course, it is important to note that these economic interpretations are 
mere speculations. These comments only point out that, for this particular example, the 
proposed detection procedure in Section 2 identifies changes in the correlation structure 
that fare well with well known events affecting the U.S. financial market. 

Figure [8] goes around here 



6. Conclusions 

In this paper, we have proposed a binary segmentation detection procedure for change 
points in the correlation structure of random variables. As far as we know, this is the first 
procedure for solving such a problem. The procedure is based on a CUSUM test statistic 



proposed by Wied et al. (2011). The asymptotic distribution of the test coincides with 
the one of the supremum of the absolute value of a Brownian Bridge in the interval [0, 1]. 
We have shown that the proposed procedure is consistent to detect the true number 
of location of the change points. Also, the finite sample properties of the procedure 
have been analyzed by the analysis of several simulation studies and the application of 
the procedure to a real data example. The empirical findings in the real data example 
suggest that the procedure detects changes in situations in which the relationship between 
financial returns may change due to financial crisis. 
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A potential drawback of our procedure is the fact that it is designed for a bivariate 
vector, but not for a higher-dimensional correlation matrix. Of course it is possible to 
consider each entry in the correlation matrix separately to determine when there have 
been changes in the individual correlations. It might be an interesting task for further 
research to do this in a more sophisticated way. 
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A. Appendix section 



A.l. The scalar D from the test statistic ([!]) 

The scalar D from our test statistic Qt{X,Y) based on all observations can be written 



as 



D = (F 1 D 3 , 1 + F 2 D 3 , 2 + F 3 D 3t3 )-i 



where 
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E u = Di,u - 4^^1,13 + 4//^ 1)33 , 

= #21 = -Di,i2 — 2/i x -Di i2 3 — 2/tj / _D lil4 + 4jl x jlyD ltM , 

^13 = ^31 = — As/-^1,13 + 2>P>xP>yDi t 33 — faD^u + 2/j^Z) 1i34 + Z) ljl5 — 2jl x D lj35 , 
E 23 = E 32 = —p>yDi t 23 + ZftxP-yD^U ~ + 2/t 2 ) -Dl j 34 + -Dl,25 ~ 2/1^/^1,45, 

-^33 = + ^P'xP'yDi^ ~ IfayDi^ + f^D^u + -Dl,55 ~ 2p>xDi,45, 



D 1 





D hl2 


A.13 


£>1,14 


£>1,15 




-Dl,22 


-Dl,23 


-Dl,24 


A.25 


Dl,31 


-^1,32 


-^1,33 




A,35 




-Dl,42 




-^1,44 


-^1,45 



t=l u =l V ,J 



\-Dl,51 -Dl,52 -Dl,53 -Dl,54 ^1,55/ 



C/r,7T = [logT], 



1 — |x| < 1 



0, 



otherwise 



a x a y 
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and 



°l = {X 2 ) T - (X T ) 2 , d* = (F 2 ) T - {Y T )\ a xy = (XY) T - X T Y T . 



This is the same expression as in Appendix A.l in Wied et al. (2011). The estimator 
based on the relevant sub-sample is basically calculated in the same way, whereas the the 
respective interval length is taken into account. 

A.2. Proofs 

Proof of Theorem [7] 

We consider the interval [li, I2]. Denote r](z) = [1 + z(T — 1)] and 



X v{z) = V" x Y n{z) = - Y 



v >n(z) _ 1 YV 

Vih) ~ V(z)-v(h) + l . *7t , 



t=v(h) t=r)(h) 

T)(Z) 



t=v(h) 





p^C= , w _; ( , i)+1 e(^-^;' 



t=v(h) 



p n(h) ~ 



[VarAI-^IVarF]-,', 

Let D be the variance estimator from Appendix A.l calculated from the observations 
T](h) to 77(^2)- Obtaining the maximum of Bt{z) is equivalent to obtaining the maximum 
of \A T (z)\ with 



Ariz) := D 



20 



We first show that A T (z) converges in distribution to 



A(z) := C A ^jf g(t)dt -z£ g(t)<ttj 



uniformly in z G [h,^} with a constant Ca- For this purpose, write 



A (A - n ^)~^i) + 1 (*>(*) n \ n VW-VW + L (>n(h) \ 



^) -^l) + 1 /^(ia) 



with p 



a x <r y 



and consider an arbitrary e > such that l\ + e < l 2 . We thus have 



(z) ^v(z) 



A T (z) := D 



V(z) ~ V{h) + 1 / AY r,(h)- A v (h) y r,(h) m xy -n x Hy 



fw — r\/ v-i ^ 



r/(/ 2 ) v*j(J2)=i7»KJa) 



O x O y 



^ 77(g) -7/(Zi) + 1 f X ^r ; (/i) ~ ^aO^/l) "1*1/ - 
^ 2 )-^l) + l 



Trth) 



■rth) 



a x a y 



Straightforward calculations using the strong law of large numbers, Slutzky's theorem, 
the fact that 



sup 



= sup 

ze[h+e,l 2 ] 



1 



rtz) nz 

-— J2 ( X tYt ~ m xy ) - J g(t)dt 



t=v(h) 

rtz) 



1 y 



g(t)dt 



->■ 



and the fact that sup z ^ ll+e ^(i](z) — 77(^1) + 1) — > 00 yields 



A T (Z) ^ a . s . A{z) 



and 



\A T (z)\ \A(z) 



uniformly on [li + e, / 2 ] 
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Consider now the case e = 0. By the preceding calculations, we immediately get 



A T (z) -> a . a . A(z) 

uniformly on [Zi + e, Z2] for a fixed e > such that Zi + e < ^- Consider now the following 
functions: 

Ar(z), -2>/i + e 



AHz) 



Z x < 2 < l x + e 

A(z), z>k + e 

h < z < l x + e 



The previous results then imply that 
for T —7- 00 on [Zi + e, Z 2 ] and also 

A e (-)^ d A(.) 

for rational e — )■ 0. The convergence of Ar(-) on [Zi, Z2] then follows from Theorem 4.2 in 



Billingsley (1968) if we can show that 



limlimsupP( sup — At(z)\ > rj) = limlimsupP( sup > rj) = 

for all rj > 0. Note that the separability condition of this theorem is not necessary in our 
case, because for each interval / C [0, 1], sup zg/ is always a random variable when 

A(-) is a right-continuous random function. 
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Since 



limsupP( sup > 77) < limsupP( sup 

T-s>oo ze[h,h+e] T->oo 2;g [; li / 1 + e ] 



D2 



>v) 



we get 



and 



P( sup D A 2e > rf) 

ze[h,h+e] 



A T {Z) ^ a .s. A(z) 



\A T { Z )\ -+ a .,. \A(z)\ 

uniformly on [Zi, Z2] - 

With Assumptions [4] and [5j has a unique maximum m in the change point fraction. 

Let F the maximum of for z G [h, l-z\. Since > |A r (m)| we get stochastic 

convergence of F to m (compare the argument in 



Bai and Perron 



1998 



p.77). 



Proof of Theorem 
With Theorem 1, we get 

under the alternative. Since 



A T (z) -> a . a . A(z) 



Q T (X,Y) = Vf sup \A T {z) 



ze[o,i] 



we have 



Qt(X,Y) ^ p oo 



(4) 



for a? = o y v Tj under the alternative. With this argument (which is partially similar to 
Corollary 2 in Andrews |1993 ), one can adapt the proof of Proposition 11 of Bai (1997). 
Consider the event {b < b}. If the estimated number of change points b is smaller than b, 
there is at least one segment [§k, si] with s fc — Y p s fc and §1 — > p si such that there is another 
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change point s m G [sfc,sj]. Since P(Qt{X,Y) > a T ) — > 1 as T — > oo with Q, we have 
P(b < b) — > as T — > oo. Consider the event {b > b}. For this event to be true, there 
must be a false rejection of the null hypothesis at a certain stage in the segmentation 
procedure. If (sj, i = 0, . . . , b) are the true change points and (Sj, i = 0, . . . , b) are the 
corresponding consistent estimates, it holds 

P(b > b) < P(3i : the test based on data for r(z) with z G Si + i] rejects) 

b 

< P( the test based on data for t(z) with z G [sj, Sj+i] rejects). 

Since under the null hypothesis P(Qt(X, Y) > a T ) — > 0, it holds with the test statistic 
computed for t(z) with z G Q l T (X, Y), 



P(b >b)<(b + l) max P(Q^(X, Y) > a T ) -> 0. 

0<i<b 



Consequently, P(b < b) — > 1 for T — > oo. 

Combining the argumentation for the event {b < b} with Theorem [l] yields the proposed 
consistency results and the proof is completed. ■ 
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Figure 1: Function A*(z) in example ^ for z £ [0, 1] 




Table 1: Results for type I errors. 
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Figure 2: Function A*(z) in example ^ for z £ [0, 1] 
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Figure 4: First step of the procedure 

First iteration 
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Figure 6: Second step of the procedure (second iteration) 
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Figure 8: Scatterplots of the two indexes at three different subperiods 
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lable 6 


: Estimation of the change point. 
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Table 4: Results for two change points. 



= 


= 0.5 


(^2) = (1/3,2/3) 


O^) = (1/3,2/3) 


Rel. freq. 


Rel. freq. 



Po Pi Pi T 1 2 > 3 1 2 > 3 

200 .852 .134 .014 .000 .843 .131 .025 .001 

500 .458 .422 .117 .003 .529 .359 .107 .005 

.5 .25 1000 .130 .525 .339 .006 .169 .507 .306 .018 

2000 .018 .271 .702 .009 .017 .337 .627 .019 

3000 .003 .133 .844 .020 .007 .205 .761 .027 

200 .622 .352 .026 .000 .660 .286 .052 .002 

500 .240 .554 .199 .007 .314 .523 .147 .016 

.5 .25 .75 1000 .042 .456 .490 .012 .074 .500 .404 .022 

2000 .006 .159 .802 .033 .012 .217 .734 .037 

3000 .004 .062 .891 .043 .002 .091 .850 .057 

200 .699 .223 .078 .000 .711 .204 .080 .005 

500 .253 .313 .424 .010 .345 .328 .318 .009 

.5 .75 .25 1000 .061 .170 .753 .016 .089 .210 .670 .031 

2000 .010 .036 .932 .022 .007 .047 .903 .043 

3000 .001 .012 .956 .031 .005 .011 .940 .044 
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Table 5: Estimation of two change points. 
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Table 6: Iterations taken by the procedure in the real data example, (*) means significant 
change point. 



Step 1 


Interval 


Qt(X,Y) 


Change point 


Time point 


Date 


[1,3524] 


1.5699 (*) 


988 


0.2803 


November 29, 2000 


Step 2 


Interval 


Qt(X,Y) 


Change point 


Time point 


Date 


[1,988] 


2.1009 (*) 


664 


0.1884 


August 19, 1999 


[989, 3524] 


1.4744 


2966 


0.8416 


October 14, 2008 


[1,664] 


1.0482 


157 


0.0445 


August 14, 1997 


[665, 988] 


1.3470 


825 


0.2341 


April 7, 2000 


[989, 3524] 


1.4744 


2966 


0.8416 


October 14, 2008 


Step 3 


Interval 


Qt(X,Y) 


Change point 


Time point 


Date 


[1,988] 


2.1009 (*) 


664 


0.1884 


August 19, 1999 


[665, 3524] 


1.6193 (*) 


2734 


0.7758 


November 12, 2007 
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